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Abstract 

We initiate the study of an interesting aspect of sponsored search advertising, namely the conse- 
quences of broad match- a feature where an ad of an advertiser can be mapped to a broader range of 
relevant queries, and not necessarily to the particular keyword(s) that ad is associated with. In spite of its 
unanimously believed importance, this aspect has not been formally studied yet, perhaps because of the 
inherent difficulty involved in formulating a tractable framework that can yield meaningful conclusions. 
In this paper, we provide a natural and reasonable framework that allows us to make definite statements 
about the economic outcomes of broad match. 

Starting with a very natural setting for strategies available to the advertisers, and via a careful look 
through the algorithmic lens, we first propose solution concepts for the game originating from the strate- 
gic behavior of advertisers as they try to optimize their budget allocation across various keywords. 

Next, we consider two broad match scenarios based on factors such as information asymmetry be- 
tween advertisers and the auctioneer (i.e. the search engine company), and the extent of auctioneer's 
control on the budget splitting. In the first scenario, the advertisers have the full information about broad 
match and relevant parameters, and can reapportion their own budgets to utilize the extra information; 
in particular, the auctioneer has no direct control over budget splitting. We show that, the same broad 
match may lead to different equilibria, one leading to a revenue improvement, whereas another to a rev- 
enue loss. This leaves the auctioneer in a dilemma - whether to broad-match or not, and consequently 
leaves him with a computational problem of predicting which broad matches will provably lead to rev- 
enue improvement. This motivates us to consider another broad match scenario, where the advertisers 
have information only about the current scenario, i.e., without any broad-match, and the allocation of 
the budgets unspent in the current scenario is in the control of the auctioneer. Perhaps not surprisingly, 
we observe that if the quality of broad match is good, the auctioneer can always improve his revenue by 
judiciously using broad match. Thus, information seems to be a double-edged sword for the auctioneer. 
Further, we also discuss the effect of both broad match scenarios on social welfare. 



1 Introduction and Motivation 

The importance of understanding the various aspects of sponsored search advertising(SSA) is now well 
known. Indeed, this advertising framework has been studied extensively in recent years from algorithmic El 
H [ED ED, game-theoretic||5l|22j[Tl|9j|3j@l[ni. learning-theoretic ff\ |24l [T9l perspectives, as well as, from 
the viewpoint of emerging diversification in the internet economy ll20l I2T1 . Specifically, among others, these 
studies include important aspects such as (i) the design of mechanisms for optimizing the revenue of the 
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auctioneer, (ii) budget optimization problem of an advertiser, (iii) analyzing the bidding behavior of adver- 
tisers in the auction of a keyword query, (iv) learning the Click-Through-Rates and (v) the role of for-profit 
mediators. In the present paper, our goal is to initiate the study of yet another interesting aspect of SSA, 
namely the consequences of broad match. Despite its unanimously believed importance, to the best of our 
knowledge, this aspect has not been formally studied yet, probably because of the hardness in employing a 
proper framework for such an study. Our main aim in this paper is to attempt to provide such a framework. 
In the rest part of this section, we start by an informal introduction to broad match, then while establishing 
the need of a framework to study broad match we provide a glimpse of the present work. 

1.1 What is Broad Match? 

In SSA format, each advertiser has a set of keywords relevant to her products and a daily budget that she 
wants to spend on these keywords. Further, for each of these keywords she has a true value associated with 
it that she derives when a user clicks on her ad corresponding to that keyword, and based on this true value 
she reports her bid to the search engine company (i.e. the auctioneer) to indicate the maximum amount she 
is willing to pay for a click. When a user queries for a keyword, the auctioneer runs an auction among all the 
advertisers interested in that keyword whose budget is not over yet. The advertisers winning in this auction 
are allocated an ad slot each, determined according to the auction's allocation rule and an advertiser is 
charged, whenever the user clicks on her ad, an amount determined according to the auction's payment rule. 
In this way, for each of the advertisers, each of her ads is matched to queries of the particular keyword(s) 
that ad is associated with. 

Broad Match is a feature where an ad of an advertiser can be mapped to a broader range of relevant 
queries, and not necessarily to the particular keyword(s) that ad is associated with. Such relevant queries 
could be possible variations of the associated keyword or could even correspond to a completely different 
keyword which is conceptually related to the associated keyword. For example, the variations of keyword 
"Scuba" could include "Scuba diving", "Scuba gear", "Scuba shops in Los Angeles" etc and conceptually 
related keywords could include "Snorkeling", "under water photography" etc. Similarly, for the keyword 
"internet advertising", the variations could include "banner advertising", "PPC advertising", "advertising 
on the internet", "keyword advertising", "online advertising" etc, and conceptually related keywords could 
include "adword", "adsense" etc; for the keyword "horse race", the variations could include "horse rac- 
ing tickets", "horse race betting", "online horse racing" etc, and a conceptually related keyword could be 
"thoroughbred". 

1.2 The Need of a Framework to Study Broad Match 

To study the effect of incorporating broad match on macroscopic quantities such as revenue of the auctioneer, 
social value etc, compared to the scenario without broad match, we must consider the interaction among 
various keywords. We must take into account the changes in the bidding behavior of advertisers for specific 
keyword queries, as well as, the effect of changes in their budget allocation across various keywords. As 
we mentioned earlier, the incentive properties of query specific keyword auction is well analyzed in articles 
such asO HU [U |9j [2 |H [TT|. Further, the budget optimization problem of an advertiser, that is to spend 
the budget across various keywords in an optimal manner given quantities such as keyword specific cost- 
per-clicks, expected number of clicks, and payoffs as a function of bid, has also been studied under various 
models (H [T3l [TH. However, the incentive constraints originating from such budget optimizing strategic 
behavior of advertisers has not been formally studied yet. In particular, there is no proposed solution concept 
pertinent for predicting the stable behavior of advertisers in this game[2J. On the other hand, for the analysis 
of the effect of incorporating broad match, it becomes inevitable to have such a solution concept, that 
is a notion of equilibrium behavior, under which we can compare the quantities such as revenue of the 
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auctioneer, social value etc, for the two scenarios one with the broad match and the one without it at their 
respective equilibrium points. 

1.3 Our Results 

Our first goal in this paper is to attempt to provide a reasonable solution concept for the game originating 
from the strategic behavior of advertisers as they try to optimize their budget allocation/splitting across 
various keywords. We realize that without some reasonable restrictions on the set of available strategies to 
the advertisers, it is a much harder task to achieve0. To this end, we consider a very natural setting for 
available strategies- (i) first split/allocate the budget across various keywords and then (ii) play the keyword 
query specific bidding/auction game as long as you have budget left over for that keyword when that query 
arrives -thereby dividing the overall game in to two stages. In the spirit of O EZH wherein the query specific 
keyword auction is modeled as a static one shot game of complete information despite its repeated nature in 
practice, we model the budget splitting too as a static one shot game of complete information because if the 
budget splitting and bidding process ever stabilize, advertisers will be playing static best responses to their 
competitors' strategies. 

Now, with this one shot complete information game modeling, the most natural solution concept to con- 
sider is pure Nash equilibrium. However, in our case, it seems to be a very strong notion of equilibrium 
behavior if we look through our algorithmic glassed because, as we argue in the paper, an advertiser's 
problem of choosing her best response is computationally hard. Consequently, we first consider a weaker 
solution concept based on local Nash equilibrium. We show that there is a strongly polynomial time (poly- 
nomial in number of advertisers, keywords and ad slots and not the volume of queries or total daily budget) 
algorithm to compute an advertiser's locally best response. This notion of equilibrium, which we call broad 
match equilibrium (BME), is defined in terms of marginal payoff (or bang-per-buck) for various keywords 
corresponding to a budget splitting, and looks similar to the definition of user equilibrium/Waldrop equilib- 
rium in routing and transportation science literature lfT7l . Further, there is a strongly poly-time algorithm to 
compute an advertiser's approximate best response as well. Therefore, the approximate Nash equilibrium 
(e-NE) is also reasonable in our setting. Indeed, all the conclusions of our work hold true irrespective of 
which of these two solution concepts we adopt, the BME or the e-NE. 

In the full information setting, under the solution concept of BME we obtain several observations by 
explicitly constructing examples. In particular, even with this natural and reasonably restricted notion of 
stable behavior, we observe that same broad match might lead to different BMEs where one BME might 
lead to an improvement in the revenue of the auctioneer over the scenario without broad match while the 
other to a loss in revenue, even when the quality of broad-match is very good. This leaves the auctioneer in 
a dilemma about whether he should broad-match or not. If he could somehow predict which choice of broad 
match lead to a revenue improvement for him and which not, he could potentially choose the ones leading to 
a revenue improvement. Further, the same examples imply the same conclusions under the solution concept 
of e-NE. This brings forth one of the big questions left open in this paper, that is of efficiently computing a 
BME I e-NE, if one exists, given a choice of broad match. 

Note that in the above scenario of broad match, the advertisers had all the relevant information about the 
broad match, and the control of budget splitting is in the hands of respective advertisers. We also explore 
another broad match scenario where the advertisers have information only about the scenario without broad- 
match, and they allow the auctioneer to spend their excess budgets (i.e the budget unspent in the scenario 
without broad match), in whatever way the auctioneer wants, in the hope of potential improvements in their 
payoffs. Perhaps not surprisingly, we observe that if the quality of broad match is good, the auctioneer 

1 Efficient computability is an important modeling prerequisite for solution concepts. In the words of Kamal Jain, "If your laptop 
cannot find it, neither can the market" (14) ■ 
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can always improve his revenue. Thus, information seems to be a double-edged sword for the auctioneer. 
Further, we also discuss the effect of both broad match scenarios on social welfare. 

1.4 Organization of the Paper 

Rest of the paper is organized as follows. In Section [2l we describe the formal setting of the query specific 
keyword auctions. In Section[3l we define Broad Match Graph, a weighted bipartite graph between the set of 
advertisers and the set of keywords, which serves as the basic back bone in terms of which we formulate all 
our notions and results. In Section|4l we present the defining characteristics of the two broad match scenarios 
studied in this paper, in terms of factors such as information asymmetry and the extent of auctioneer's control 
on the budget splitting. The Section[5]is devoted to the study of broad match scenario in the full information 
setting where the auctioneer has no direct control on the budget splitting. This is essentially the most 
contributing part of this paper wherein first we appropriately model the game originating from the strategic 
behavior of advertisers as they try to optimize their budget allocation/splitting across various keywords, 
then we propose appropriate solution concepts for this game, and finally study the effect of broad match 
under this proposed framework. In Section [6j we study the other broad match scenario, where advertisers 
do not have full information about the broad match being performed and the auctioneer partially controls 
the budget splitting. Finally, in Section [7J we conclude with potential directions for future work motivated 
by the present paper. 

2 Keyword Auctions 

There are K slots to be allocated among N (> K) bidders (i.e. the advertisers). A bidder i has a true 
valuation Vi (known only to the bidder i) for the specific keyword and she bids 6j. The expected click 
through rate (CTR) of an ad put by bidder i when allocated slot j has the form CTRij = jjei i.e. separable 
in to a position effect and an advertiser effect, 77's can be interpreted as the probability that an ad will be 
noticed when put in slot j and it is assumed that 7^ > "fj+i for all 1 < j < K and 7, = for j > K. ei 
can be interpreted as the probability that an ad put by bidder i will be clicked on if noticed and is referred to 
as the relevance of bidder i. The payoff/utility of bidder i when given slot j at a price of p per-click is given 
by ej7j(fj — p) and they are assumed to be rational agents trying to maximize their payoffs. 

As of now, Google as well as Yahoo! use schemes closely modeled as RBR(rank by revenue) with 
GSP(generalized second pricing). The bidders are ranked in the decreasing order of e^fej and the slots are 
allocated as per this ranks. For simplicity of notation, assume that the ith bidder is the one allocated slot i 
according to this ranking rule, then i is charged an amount equal to e ' +lb ' +1 per-click. This mechanism has 
been extensively studied in recent years (5J|9j|22j[TTl. The solution concept that is widely adopted to study 
this auction game is a refinement of Nash equilibrium independently proposed by Varian |[22l and Edelman et 
al[5 ]. Under this refinement, the bidders have no incentive to change to another positions even at the current 
price paid by the bidders currently at that position. Edelmen et al Q calls it locally envy-free equilibria and 
argue that such an equilibrium arises if agents are raising their bids to increase the payments of those above 
them, a practice which is believed to be common in actual keyword auctions. Varian[22 ] called it symmetric 
Nash equilibria(SNE) and provided some empirical evidence that the Google bid data agrees well with the 
SNE bid profile. In particular, an SNE bid profile b\ 's satisfy 

(li ~ 7i+lH+l e i+l + 7i+l e i+2&i+2 < nei+ih+i 

< (7i - j i+ i)viei + j i+1 e i+2 b i+ 2 (1) 

for alH = 1, 2, . . . , N. Now, recall that in the RBR with GSP mechanism, the bidder i pays an amount 
e i+1 b i+1 p er _ c jj c j C) therefore the expected payment i makes per-impression is 7iej e ' + * fe ' +1 = 7jej + i&j + i. 
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Thus the best SNE bid profile for advertisers (worst for the auctioneer) is minimum bid profile possible 
according to Equation Q] and is given by 



A 



E(7i ~ 7j+i)«j+i^+i 



(2) 



j=i 



and therefore, the revenue of the auctioneer at this minimum SNE is 



K 

y^7i e i+l^+l 



E£(7j - 7i+l)«j+iej+i 



A A" 



(3) 



i=l 



i=l j=i 



K 



5^(7i - 7j+i)iVj+i e j+i- 



8=1 



3 Broad Match Graph 



In this section, we set up a basic backbone to study the dynamics of bidding across various related keywords, 
in the sponsored search advertising, via a bipartite graph between the set of bidders (i.e. the advertisers) and 
the set of keywords with the parameters (N, M, K,S = {sij), B = (B{), V = (Vj)) as follows. 

• Keywords: M is the number of keywords in consideration. We will denote the set of keywords 
{1,2,..., M} by M. Further, Vj is the total (expected) volume of queries for the keyword j for a given 
period of time which we call a "day". These may not be all the keywords a particular advertiser may 
be interested in bidding on but one of the sets of keywords which are related by some category /concept 
that the advertiser is interested in bidding on e.g. for selling the same product or service. 

• Bidders/Advertisers: N is the total number of bidders interested in the above M keywords. We will 
denote the set of bidders {1,2,..., N} by TsT. Further, Bi is the total budget of the bidder i for a "day" 
that she wants to spend on these keywords. 

• Slots: K is the maximum number of ad-slots available. Let L be the number of advertisers with 
sufficient budgets when a particular query of a keyword j arrives then for that query of j, the slot- 
clickability (i.e. the position based CTRs) are defined to be 7; corresponding to a slot I for I < 
min{K, L} and zero otherwisd^l 

• Valuation Matrix: Let Vij be the true value of the bidder i for the keyword j and e^j be her rele- 
vance 1 (quality score) for j. We call the NxM matrix S, with (i, j)th entries defined as s-ij = Vijeij, 
as the valuation matrix. For a keyword j for which a bidder i has no interest or is not allowed to bid, 
Sij := —00. In the revenue/efficiency/payoffs calculations at SNE, it is enough to know Sij and not 
the Vij and e^j separately, therefore, in this paper we will often refer only to Sij and not Vij and eij 
individually. 

• Broad Match Graph (BMG): Given instance parameters (N, M, K, S, B, V), a bipartite graph G = 
(N, M,£), with vertex sets Ji and M and edge set £ = : i £ N,j S M, Sjj > 0}, is 
constructed. Further, each edge G £ is associated with a weight Sij. The weight of a node 
i G TsT is Bi and that of a node j G M is Vj. Furthermore, an i G K will be called an ad-node and an 
j G M will be referred to as a keyword-node. 

2 We assume that the clicks-through-rates(CTRs) are separable. 
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• Extension of a BMG: A BMG G' = (7f, M, £') for the instance (JV, M, K, S',B, V) is called an 
extension of a BMG G = (N, M, £) for the instance (N, M, K, S, B, V) if £ C £' and s[ j = s itj for 
all (i, j) 6 £. Interpretation is that the instance represented by an extension is more broader in the 
sense that an advertiser has a choice to be and could be matched to a larger set of keywords. Indeed, 
we will formally refer an extension G = (N, M, £ ) to be a broad-match for its base G = (N, M, £). 
Note that, without loss of generality, this definition of extension (broad-match) captures the possibility 
that new keywords are introduced. This is because we can always think that a new keyword to be 
included was already there as an isolated node and now we are creating only edges. 

4 Information Asymmetry, Budget Splitting and Two Broad Match Scenar- 
ios 

Depending on how much information the auctioneer (i.e. the search engine company) provides to advertisers 
about various parameters, such as CTRs, conversion rates, volume of queries etc, related to the keywords 
and depending on who splits the daily budget across the various keywords, the auctioneer or the advertisers, 
we consider the following two scenarios for broad matching: 

I) AdBM: Advertisers Controlled Broad Match, extra information to advertisers, advertisers split their 
budgets. 

II) AcBM : Auctioneer Controlled Broad Match, no extra information to advertisers, auctioneer splits 
the budget. 

Formally, let G' = (N, M, £ ) be the broad-match for G = (N, M, £), then we characterize these two 
scenarios of broad-match based on the following factors(ref. Tabled]): 

• Information asymmetry: Of course, advertisers already have all the information about the BMG G. 
That is, they all know their CTRs, conversion rates, volume of queries, who participates how long, 
spends and bids how much on which keyword, across various keywords they are bidding currently. 
It is the knowledge about these quantities for the part of the extension G which is not in G i.e. 
along the edges in £ \ £ that advertisers may or may not be aware of depending on whether the 
auctioneer provides this information to them or no|E In AcBM scenario, advertisers do not have this 
information. At most what they know is that there is some broad-match the auctioneer is performing 
and they can effect the dynamics on the extension G only passively through their bidding behavior on 
G0. In AdBM scenario, there is no such asymmetry in information. Advertisers know (or are rather 
informed by the auctioneer) all the information about £ \ £. 

• Extent of auctioneer's control on the budget splitting: In AdBM scenario, the auctioneer has no 
control over the budget splitting. It is upto the advertisers to decide which keywords they want to 
participate in and how much budget they want to spend on each of those keywords. Thus, the budget 
splitting of an advertiser, across various keywords she is connected to in G , is in her own control 
and she can split her budget so as to maximize her total payoff across those keywords. As discussed 
in section 11.21 this brings in another layer of incentive constraints from advertisers besides choosing 

3 Of course, to learn these relevance scores there might be a cost incurred by the auctioneer in the short run, nevertheless if the 
broad-match quality is good, this cost will generally be minimal. 

4 As we know, in the sponsored search advertising, advertisers effectively derive their valuations from the rate of conversion 
which might very well change by broad match and accordingly bidders may adjust their true values (and consequently the bids) 
due to broad match. This might lead to another level of cost to the auctioneer due to uncertainty in performing broad-match. 
Nevertheless, if the broad-match quality is good, this cost will generally be minimal. 
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AdBM 


AcBM 


Information asymmetry 


No 

(advertisers know all info on new edges) 


Yes 

(advertisers don't know info on new edges) 


Extent of auctioneer's control on the budget splitting 


No Control 


Limited Control 
(only for the excess budgets) 


An advertiser 's starting time for a keyword 


Very First Query 


Any Query 
(only for the part auctioneer controls) 



Table 1: Two Broad-Match Scenarios 



the bid values for individual queries. In AcBM scenario, the auctioneer does have some control over 
budget splitting. At one extreme, an advertiser might give the complete control to the auctioneer for 
splitting her budget across various keywords letting the auctioneer decide how much should be spent 
on which keyword. In this case, since the control will be completely in the hands of the auctioneer it 
can perform the budget splitting so as to maximize his own total revenue without much concern to the 
welfare of advertisers. Therefore, advertisers would hardly give such a full control to the auctioneer. 
However, it is reasonable that an advertiser allows the auctioneer to spend her excess budget i.e. the 
budget unspent in the case of G, in whatever way the auctioneer wants to spend it in G , in hope of an 
additional payoff. Note that there must be some advertisers with excess budget for the AcBM to make 
sense. If every advertiser is already spending its all budget then why to take all the pain of doing a 
broad-matcl0. In our study of AcBM in Section [6l we consider this limited control setting. Another 
issue the auctioneer faces in AcBM is that what bid profiles to use along the edges in £ \ £. The 
auctioneer must perform these calculations in a manner to maintain equilibrium across the keywords 
meaning that the advertisers should not be indirectly compelled to revise their true values along the 
edges in G 4 . This assumption is reasonable as at SNE the auctioneer can estimate the true values from 
the equilibrium bids on G and then along with the new information gathered for £ \ £, can do the 
proper SNE bids calculations on the behalf of the advertisers. 

• An advertiser's starting time for a keyword: In AdBM scenario, an advertisers participates in all 
keywords she is interested in starting with its first query until her budget allocated for that keyword is 
spent or there are no more queries for that keyword.]. In AcBM scenario, however, since the auctioneer 
is controlling at least a part of the budget splitting, and unlike advertisers, since it can easily track what 
happens at which query in an online manner, it can choose to bring in an advertiser's budget along an 
edge starting with any particular query of that keyword. For example, it can bring advertiser i in the 
auction of keyword j starting at say 1000th query of j. 

Now, let us illustrate the above two scenarios via an example by considering the BMG given in Figure Q] 
(i.e. the graph without the edge (3, 1)) and its extension given in the same figure (i.e. the graph including the 
edge (3, 1)). Further, each query is sold via a GSP auction (ref. Section [2]), and the revenues are calculated 
at SNE (ref. Equations 12 [3] in Section [2]). In the base BMG, under GSP the advertiser 2 pays zero amount 
for each query since there is no bidder ranked below her, therefore even with a very small budget she is 
able to participate in all the queries of keyword 1. Thus, the total revenue extracted in the base BMG is 

5 In case, doing a broad-match encourages advertisers to increase their budgets, for the purpose of analysis, this increase can be 
considered as an excess budget. 

6 In reality, Google/Yahoo! roughly allows the advertisers to specify which part of the day they want to spend most of their 
budgets, nevertheless this option does not give a finer control such as specifying a particular query number they can start with. 
Therefore, to simplify the incentive analysis we do not consider such option in this paper. Further, note that if the advertisers are 
given a chance to express their desire as to which part of the day they want to spend how much budget, and as long as the day 
is divided in to few parts (say polynomially many in the size of the BMG), then this expressiveness can be easily captured in the 
present framework by replicating the role of relevant keyword nodes and dividing the total volume of queries for that keyword node 
among these new nodes according to the size of the various parts of the day. 
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0.9Vi + O.6V2, 0.9Vi from keyword 1 and O.6V2 from the keyword 2. Now in the extension, there is a 
new edge (3, 1). In the AdBM scenario, since the advertiser 3 has the control of splitting the budget and 
whether she wants to participate for keyword 1, she participates along (3, 1) for all queries as she never 
needs to pay anything but certainly gets the second slot and positive payoffs for all the queries after the 
advertiser 2 spends its all budget and drops out i.e. for all the queries after the first eVith query. Note that 
advertiser 2 now pays a positive amount for each query and is forced to drop as its total budget gets spent. 
Therefore, the new revenue of the auctioneer is 0.9Vi + 3.1Vi (e — j^f ) + O.6V2 which is smaller than the 
revenue generated in the base BMG if e < M. However, in the AcBM scenario, the control is in the hands of 
auctioneer and he can choose not to spend the (excess) budget of advertiser 3 along (3, 1), thereby avoiding 
a potential revenue loss. Moreover, he has a finer control and can choose to bring 3 along (3, 1) after some 
queries for 1 has already arrived. In particular, if he brings 3 along (3, 1) starting (1 — e)V\ + 1th query, the 
new revenue generated is (1 — e)0.9Vi + 2.3eV\ + lAeVi + O.6V2 which is more than the revenue generated 
in the base BMG. 

Advertisers Keywords 




Figure 1: A BMG (without the edge (3,1) ) and an extension (with the edge (3,1)), K = 2, 71 = 1, 72 = 0.7 



5 Advertisers Controlled Broad Match (AdBM) 

In this section, we start out with our first goal, that is to provide a reasonable solution concept for the game 
originating from strategic behavior of advertisers as they try to optimize their budget allocation/splitting 
across various keywords. We realize that without some reasonable restrictions on the set of available strate- 
gies to the advertisers, it is a much harder task to achieved!. To this end, we consider a very natural setting 
for available strategies- (i) first split/allocate the budget across various keywords and then (ii) play the key- 
word query specific bidding/auction game as long as you have budget left over for that keyword when that 
query arrives -thereby dividing the overall game in to two stages. 

The second stage is exactly the query specific keyword auction and for that stage we can utilize the equi- 
librium behavior proposed in literature. Therefore, in the second stage, we are restricting the behavior of 
advertisers in that, given the availability of budget to participate in the auction of a particular query, an adver- 
tiser acts rationally but being ignorant about and disregarding the fact that her bidding behavior might effect 
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her decision in the splitting/allocation of her budget across various keywords in the first stage. For example, 
in the second stage, for the auction mechanisms currently used by Google and Yahoo! i.e. Generalized 
Second Price (GSP) Mechanism, we can adopt the solution concept of Symmetric Nash Equilibrium(SNE) 
proposed in EE]. That is, once the budget is split, for each query all advertisers, with available budgets 
for the corresponding keyword, bid according to a minimum SNE bid profile. Note that, even for the same 
keyword, different queries may have different SNE bid profiles because the set of advertisers with available 
budgets may be different when those queries arrive. 

Now, for the first stage i.e. budget splitting, we do not put any restriction on how the advertisers split 
their budget across various keywords. Each advertiser, knowing the fact that all advertisers will behave 
according to SNE for keyword queries, chooses her budget splitting so as to maximize her total payoff 
across all queries of all keywords for the day. Thus, with the restriction on the bidding behavior in second 
stage, we are left only to analyze the game of splitting the budget across various keywords. 

In the spirit of E3 wherein the query specific keyword auction is modeled as a static one shot game 
of complete information despite its repeated nature in practice, we model the budget splitting too as a static 
one shot game of complete information because if the budget splitting and bidding process ever stabilize, 
advertisers will be playing static best responses to their competitors' strategies. Let us refer to this game 
as Broad-Match Game. In the following, we search for a reasonable notion of stable budget splitting i.e. a 
reasonable solution concept for the Broad-Match Game. 

5.1 Advertisers' Best Response Problem and Search for an Appropriate Solution Concept 
for Broad-Match Game 

Let the budget of advertiser i decided for the keyword j be Bij, then this budget allows her to participate 
for the first Vij number of queries for some Vij < Vj. And given a number of queries for j, there will 
be a budget requirement from i, depending on the bidding interests of the other bidders, their valuations etc, 
to participate in the first Vij queries of j. Thus, there is a one-to-one correspondence between the budget 
spent on a particular keyword by a bidder and the total number of queries of that keyword stalling the first 
one, that the bidder participates in. Clearly, then the splitting of budget across various keywords can be 
equivalently considered as deciding on how many queries to participate in, starting their first queries, for 
those keywords. 

Now, with the one shot complete information game modeling, the most natural solution concept to 
consider is pure Nash equilibrium which in our scenario (i.e. for the Broad-Match Game correspond- 
ing to the instance (N, M, K, S, B, V) & BMG (N, M, £)) can be defined as the matrix of query values 
{Vij, G £} such that for each i, given that these values are fixed for all I 6 X — {i}, no other values 
of {K.j} gives abetter total payoff to the bidder i. Equivalently, it is the matrix of values {Bij, € £} 
such that for each i, given that the these values are fixed for all Z € N — {i}, no other values of {Bij} gives 
a better total payoff to the bidder i. 

The solution concept of pure Nash equilibrium that we have proposed above requires that the players 
(i.e. the advertisers) are powerful enough to play the game rationally. In particular, they should be able 
to compute the best responses to their competitors' strategies. In the present case, for an advertiser i, it 
boils down to solving an optimization problem of finding the budget-splitting across various keywords with 
maximum total payoff, given the budget- splitting of other advertisers. That is, given the values {B n j} for 
all re € N — {i} and for all j € M, to compute {Bij} yielding maximum payoff for the advertiser i. Of 
course, the details of BMG is known to every player. To justify this solution concept on algorithmic grounds, 
it should be essential that this optimization problem of advertiser i, henceforth referred to as Advertisers' 
Best Response Problem(AdBRP), should be solvable in time polynomial in N,M,K. Note that we are 
strictly asking the time complexity to be polynomial in N, M, K and not in Vj's and -Bj's. This is because 
Vj's and Bi's could in general be exponentially larger than N, M, K. In fact, in practice this seem to be the 
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case as volume of queries could be much larger than the number of advertisers and keywords. Therefore, 
let us first formulate this optimization problem for advertiser i. Let c(m,j,l) be the (expected) cost of 
advertiser m for the Ith query of the keyword j if she participates in the query I and all the previous queries 
of the keyword j. Similarly, u(m,j,l) is the (expected) payoff of advertiser m for the Zth query of the 
keyword j if she participates in the query I and all the previous queries of the keyword j. Also define the 
marginal payoff or bang-per-buck for advertiser m for the Ith query of keyword j as n(m, j, I) = "(m,'j'l) • 
To be able to compute her best response, an advertiser must be able to efficiently compute these values 
first. For each G £ there are 0(Vj) such values and at first glance it seems that the advertiser might 
take a time polynomial in Vj, i.e. not in strongly polynomial time, to compute these values. However, if 
we observe carefully there is a lot of redundancy in these values as noted in the following lemma. The 
idea is that the cost and the payoff of an advertiser for a query of a keyword depends only on the set of 
advertisers participating in that query and the valuations their of. Therefore, as long as this set is the same 
these quantities will remain the same. 

Lemma 1 For each i£X given the BMG G = (N, M, £) and the budget splitting of all advertisers in N— 
{i} (i.e. the B n j values for all r6N,j G MJ, there exist non-negative integers Aj, Zj^, Zji, Zj^, • • • , z j,Aj 
for all j G {j G M : (i, j) G £} such that Aj = O(N) and for all < A < Aj - 1, 

c{i,j,l) = C(i,j,X),u(i,j,l) =U(i,j,X),Tr(i,j,l) =U(i,j,X) Vzj t \ < I < Zj t \ +1 , 

where C(i,j,X) := c(i,j,z jt \ + 1), U(i,j, A) := u(i,j,z jt \ + l),U(i,j, A) := ir(i,j,Zj t \ + 1). Moreover, 
all these A, z, c, u, ir values can be computed together in 0(MN 2 K) time. 

Proof: The proof follows from Algorithm 15.11 described in the following. The idea in the Algorithm 
15.11 is that the cost and the payoff of an advertiser for a query of a keyword depends only on the set of 
advertisers participating in that query and the valuations their of. Therefore, as long as this set is the same 
these quantities will remain the same. Further, there can be at most choices of this set. For the very first 
query, this set consists of all the advertisers with positive budget for that keyword along with the advertiser i 
(recall that the budget splittings for other advertisers are given). This set changes when advertisers drop out 
as their budgets get spent and they no longer have enough amount to buy the next query. This change can 
occur at most N — 1 times. The Algorithm 15.11 tracks exactly when the advertisers drop out and compute 
the c, u, tt values accordingly. 

The Algorithm 15. 1 1 runs in time 0(N 2 K). Initialization requires O(N) and the while loop requires 
0(NK) in each iteration. There are at most 0(N) iteration of the while loop because for every two itera- 
tion at least one advertiser drops out. There is a NlogN term, subsumed by N 2 , that comes from the need 
to sort the valuation Sjj's before computing the SNE bid profile. 
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Algorithm 5.1: Query Partition(G, j, B n Jin G 3sf — {i}) 

i^{neN-{i}: B nJ > 0} 
A <- 

Zjfl <- 

while A / (f> 

'Compute C(n,j,X) Vn £ A U {i} 
and £7 (f, j, A) 

HC(i,j,X)=0 
then II(i, j, A) <— oo 

y^min„ eA {Lc^A)J} 
if y > 



do < 



then < 



A 1 <-A-{neA: 7 $fa=y} 
B n ,j <- 5 n ,j - yC{n,j, A) VnGi 
«j,A+i ^min{z jiA + y,^} 

A^ A + l 
then exit 

comment: Exiting the while LOOP 



else 



U M» e A : LsgftjJ ^ 0} 



if A 



then < 



fC(i,.7,A)<-0 
U(i,j,\) <- 71 Si j 
n(i,j, A) <- oo 
A <- A+ 1 



A, -A 



output (A,-, {z iiA }, {C(i,j, A)}, A)}, {n(i, j, A)}) 
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Now, using the above lemma we are ready to formulate the AdBRP of advertiser i. Let us define, 

A-l 

U(i,j,l) = (I - z jt x-i)U(i, j, A - 1) + ^ ( z j,m - Zj, m -x)U(i, j, m - 1) if Zj,a-i < I < zj,\- (4) 

m=l 
A-l 

C(i,j,l) = (I - Zj,\-i)C(i,j, A - 1) + (zj tm - Zj, m -i)C(i,j, m - 1) if Zj t \-i < I < Zj 7 \. (5) 

m— 1 

then the AdBRP is the following optimization problem in non-negative integer variables Xi j's. 

Max ^ U(i,j,Xij) 

j.t C(i,j,Xij)<Bi (6) 

(<J)ee 

0<sij<^ V(i,j) g£ 
V (i, j) g £ 

It is not hard to see that being a variant of Knapsack Problem (decision version of) AdBRP is NP-hard. 
Its NP hardness follows from simple restrictions that makes it equivalent to 0- 1 Knapsack Problem or Inte- 
ger Knapsack Problem(IKP) respectively. First, let us restrict all relevant quantities such as budget, utilities 
and costs to be integer values. Now, in AdBRP, choosing Vj = 1 for all j G M makes it equivalent to the 
0-1 Knapsack Problem. Also, in AdBRP, choosing Aj = 1 for all j £ M (i.e. a single partition for each 
keyword) makes it equivalent to the Integer Knapsack Problem. 

Now being NP-hard, AdBRP is unlikely to have an efficient algorithm and thus the solution concept 
based on pure Nash equilibrium does not seem to be reasonable and we should consider weaker notions. One 
such reasonable solution concept could be that based on local Nash equilibrium, where the advertisers are 
not required to be so sophisticated and can deviate only locally i.e. by small amounts from the their current 
strategies. We show that an advertiser's locally best response can be computed via a greedy algorithm 
in strongly polynomial time. Further, this equilibrium notion is similar to the user equilibrium/Waldrop 
equilibrium in routing and transportation science literature ifTTl . Another solution concept that we explore 
is motivated by the fact that being a variant of IKP and given that IKP can be approximated well, it may be 
possible to efficiently compute a pretty good approximation of the advertiser's best response, and therefore 
an approximate Nash equilibrium may also make sense. In the following sub-sections we investigate these 
two solution concepts. 

5.2 Broad Match Equilibrium(BME) 

Based on our discussion in section 15.11 let us first formally define the equilibrium notion based on local 
Nash equilibrium and let us refer to it as Broad Match Equilibrium(BME). 

Definition 2 Given a BMG G = (N, M, £.), a BME for G is defined as the matrix of query values {V$ j} 
and the budget splitting values {Bij} iff they satisfy the following conditions. 

El) For all i, if G £ with Vij > 0, and (i, I) G £ which is not query-saturated meaning Vij < Vj 
then MPJ~, := ^4ttHt > MP^~, := u j l ' l ,'Y?' l t]\ i.e. the advertiser i does not have an incentive to 
deviate locally from keyword j to keyword I. 
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E2) For all i, i spends her total budget Bi (on some keyword or the other) unless each £ £ is either 
query -saturated (i.e. V{ 7 j = Vj) or budget-saturated meaning that the left over budget of advertiser i 
for keyword j is insufficient to buy the next query of j. 

Now given the {V^j} and {B n j} values for all n G N — {i}, the locally best response problem ( local 
AdBRP) for % is to compute a set of {V^j} and {Bi t j} values such that they satisfy all conditions in the above 
definition of BME. We show in the following theorem that the local AdBRP can be computed efficiently. 
Thus, BME is indeed a reasonable solution concept for the Broad-Match Game. 

The idea in the proof of Theorem [3] is to first partition the queries across various keywords via Lemma 
Q]so that the payoff, cost, and marginal payoff of i is the same for all queries in a given partition. Then, in a 
GREEDY ALLOCATION PHASE, to greedily distribute the budgets across various keywords moving from 
one partition to another. Finally, in a GREEDY READJUSTMENT PHASE, if there is an edge (i, I) which is 
not stable (i.e. the advertiser i could profitably deviate locally to this edge from another edge(s)), a reverse 
greedy approach would take budgets from edge with minimum marginal payoff and put it to until 
(i, I) becomes stable, again via moving from partition to partition. Since we always move from partition to 
partition and a partition is visited at most once in each of the above two phases, and there are at most O(N) 
partitions per keyword, this algorithm is efficient. 

Theorem 3 There is a strongly polynomial time algorithm for local AdBRP. 

Proof: The proof follows from the Algorithm l5.2l provided in the following. The Algorithm 15 . 2 1 first parti- 
tions the queries across various keywords so that the payoff, cost, and marginal payoff of i is the same for 
all queries in a given partition. This is computed via Algorithm 15. 1 1 which takes 0(MN 2 K) time. Initial- 
ization takes 0(M) time. In the GREEDY ALLOCATION PHASE, the algorithm greedily distributes the 
budgets across various keywords moving from one partition to another. Each iteration of the while loop 
in this phase takes 0(M) time and since there are O(MN) partitions the total time taken for this loop is 
0(M 2 N). GREEDY READJUSTMENT PHASE first checks if there is any edge (i, I) which is not stable i.e. 
the advertiser i could profitably deviate locally to this edge from another edge(s). It is not hard to see that at 
the end of GREEDY ALLOCATION PHASE, there can be at most one such edge (ref. Lemma|U>. If there is 
such an edge, this phase adjusts the budget to make this edge stable without making any other edge unstable. 
This is achieved via a reverse greedy approach taking budgets from edge with minimum marginal payoff and 
putting to (i, I) until (£, I) becomes stable, again via moving from partition to partition. Thus, Algorithm \5.2\ 
correctly computes a locally best response for advertiser i. The while loop in this readjustment phase also 
terminates in O(MN) iterations and therefore this phase takes total of 0(M 2 N) time. Hence, the running 
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time of Algorithm El is 0(MN 2 K + M 2 N) i.e. strongly polynomial time 
Algorithm 5.2: Greedy Budget Splitting(G, B n j\/n G INT - {i},j G 
Ji = {jeM:(i,j)eE} 

comment: Computing A, z, C, U, H values for all relevant j's 

for each j G J; 

do Query Partition (G,j, B n j\/n G INT - {i}) 

comment: Initialization 

Xj <- Vj G Ji 

Cij <- 0, B Lj «- 0, <- Vj G Ji 
Ci<-0 

comment: GREEDY ALLOCATION PHASE 

while Ji ^ 4> 

' L <- ar# max.,- eJi n(i,;', A,) 
if there are more than one such index take the minimum one 

if Ci + y L C(i,L,X L ) > B t 



then < 



y <- 


1 Bi-Ci | 
Lc(i,L,Az,)J 




«- + y 


Bi,L 


<— -Bi,L + (-Bj — Ci) 




«- C l:L + yC{i,L,X L 




-Ci + yC{i,L,X L ) 


exit 





do < 



comment: Exiting the while LOOP 

(B hL ^ B hL + y L C(i,L,X L ) 
C i;L <- C i:L + y L C(i,L,X L ) 
C l <- Ci + y L C{i,L,X L ) 



else < 



if A; 



A, 



1 



comment: i.e. no more queries for L is left 



then 
else 



J, <- J, - {L} 
X L «- X L + 1 

X L <- A L + 1 



comment: GREEDY READJUSTMENT PHASE 
Ji <- {J : (*,j) G £and > 0} 



I^{Iel:(M)e£ and 3j G Ji — {Z} s.t. MP+ > MP^} 
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comment: there can be at most one such I (ref. Lemma[4]) 



if such an I does not exist 

then return ({Vij}, {Bij}) 
comment: Readjustment not required 



while Xi < A t and MP+ > mm jeJt _ {l} MPr. 
'j «- arg min jeJi _ {l} MPr- 

(Vi,j-z j>x ._ 1 )C(i,j,\ j -l)+(B iij -Ci, j )+(B iil -C i , l ) 
C(i,l,\i) 



if y < zi t \ l+ i - v it i 

' B it i <- B it i + (Vij - z j) \.- 1 )C(i,j, Xj - 1) 
~\~{Bi y j — Cij) 

<- Qj - (Vij - Zj t Xj-i)C(i, j, Xj - 1) 
Bij ^ 



then < 



C it i <- Q,/ + yC(i,l,Xi) 

Aj *~ Aj ~ 1 
4 '<,; <~ -j.A, 



do < 



else < 



iSV itj = 
then Ji <- - {j} 



(^,A, + i-K,i)C(i,^Ai)-(Bi.j-gi,j)-(^.i-Q,i) 
C(i,i,A,-l) 



Ci,j Cjj ~~ yC(i,j, Xj — l) 

Ci,l «- Ci,i + (*j,a,+i - ^,j)C(», Z, Xi) 
Bij <— Cj,; 



then < 



if y = Vij - Zj^-i 
Aj Aj - 1 

XV i;j = Q 

k then Ji^ Ji- {j} 
else <- Vij - y 

A; «- A, + 1 



output ({V id },{B id }) 
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Lemma 4 At the end of GREEDY ALLOCATION PHASE of Algorithm\5J\ there can be at most one 
unstable edge, that is 

| U € M: € land 3j s.t.(i,j) € E,V itj >0&MP+ > MPrX | < 1. 

Proof: We will provide a proof by contradiction. If possible, let there be two unstable edges namely (i, I) 
and (i, m), thus there exist an edge G £ with > such that MP+ t > MP~- and MP+ m > MP~-. 
Note that there could be many such choices of j, let us choose the one with the minimum value of MP7~-. 
Thus, in the Figure|2j we are given that 7Ti < min {^3, 7rs}. Since the greedy allocation phase fills up ( or 
selects) the partitions with higher values first, starting the markers on the first partition of all the keywords, 
there must be partitions with values 7^ < tt\ and 7r4 < tt\ as shown in Figure |2j otherwise all partitions 
of (i, I) before and with the value 713 must have been filled/selected before the partition with value tt\ and 
similarly for all partitions of (i, m) before and with the value -K<j. WLog let the partition with the value 7r2 is 
filled before that with ^4. Therefore, 7r3 < tt^ otherwise the open partition ( i.e. the one not yet completely 
filled ) with value 113 would have been filled before Further, by using 712 < ni and tt\ < 713 we get 
7T2 < 7T3 and consequently 7T2 < ^4. As before we can again argue that there exist a partition with value 
7T6 < 7T2 but we must also have it?, < ttq otherwise the open partition with value 113 would have been filled 
before -kq. But this implies K3 < 112 which is a contradiction. I. 

MP^ = it 1 < min{7r 3 ,7r 5 } 



7T2 



MP? 



7T3 > 7T 2 



7T4 



7T 5 



Figure 2: There can not be two unstable edges at the end of GREEDY ALLOCATION PHASE in Algorithm 
15.21 The 7r values shown in the figure are the marginal payoffs of i in the respective partitions. ". . . " shown 
between two partitions indicate that there could be several or no partitions between these partitions. 

One might also like to consider another natural strongly polynomial time greedy algorithm based on 
the algorithm for fractional knapsack problem, considering each keyword as an item, total payoff (from all 
the queries that can be bought within the budget constraint) as the value and the corresponding total cost 

as the size, sorting the keywords by effective marginal payoffs i.e. ^l^f cost^ ' an( * § reecm y selecting the 
keywords until budget is exhausted. However, it can be shown (Figure^ that this greedy approach does not 
always lead to a locally best response of an advertiser i.e. there are examples where the solution given by 
this algorithm is not stable and the advertiser can improve her payoff by local deviation. Further, it is not 
clear whether some readjustment procedure as in the GREEDY READJUSTMENT PHASE of the Algorithm 
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15.21 could be applied to make such solutions stable. In this paper, we have not explored this direction in 
details and it might be interesting to make this algorithm work, if possible, by some suitable readjustment, 
and compare its performance with to that of Algorithm 15.21 For an initial insight, first note that, in the 
example given in Figure [3 the marginal payoffs for a keyword are increasing with the partition number i.e. 
U(i, 2, 1) > U(i, 2, 0). The effective marginal payoff for keyword 2 is 4 -^I^ 5 > 2 when Vi > 7 and 
this algorithms therefore spends all budget on keyword 2, giving a payoff better than Algorithm 15.21 Also, 
it is a stable solution because MP~ 2 = 3 > 2 = MP^ . In fact, in this particular example it is the global 
optimum. In general, clearly when this algorithm returns a stable solution without a need for readjustment, 
it always chooses a better budget splitting than Algorithm 15 .21 however as we give an example below in the 
Figure 01 it might not always return a stable solution. 



2l,0 



zi,i = Vi 



tf(i,l,0) = 4 
C(i, 1,0) = 2 
II(i,l,0) = 2 



Z2.0 = Z 2 .\ =Vl + l Z2.2 

U(i,2,0) = 1.5 U(i,2,l) = 3 

C(i,2,0) = l C(i,2,l) = l 

II(i,2,0) = 1.5 n(i,2,l)=3 



V 2 



Figure 3: Let Bi = 2(Vi - 1) and V 2 > 2V\ - 2. Algorithm El outputs (B iA = B h B ij2 = 0,^ = 
V\ — 1, Vi t 2 = 0) with total payoff of 4Vj. — 4 to the advertiser. However, the splitting {B^\ = 0, Bi^ = 
B u V iA = 0, V L;2 = 2Vi - 2) gives a total payoff of 4.5Vi - 7.5 > W x - 4 when V x > 7. 
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U(i,l,0) = 3.36 
C(i,l,0) = 0.8 
II(i,l,0) = 4.2 



20 



U(i,2,0)=2 
C(i,2,0) = 0.4 
n(«,2,0) = 5 



10 30 

U(i,2,l) = 3.2 
C(i,2,l) = 0.8 
II(i,2,l) = 4 



Figure 4: Greedy fractional knapsack algorithm may not return a stable solution. B{ = 12. This algorithm 
allocates all the budget to the keyword 2 as the effective marginal payoffs is q^io+o^io = f§ wn i c h 
is greater than that of keyword 1 which is q^ 6 ^ 5 = ^jr- However, the edge (i, 1) is now unstable as 
MP^ = 4.2 > 4 = MPr 2 . 
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5.3 Approximate Nash Equilibrium(e-NE) 



Let us first define the approximate Nash equilibrium for our setting as follows. 

Definition 5 Given a BMG G = (N, M, £), an e-NE for G is defined as the matrix of query values {Vi,j} 
and the budget splitting values {-Bij} such that for alii, YLu j)eE ^(*' ^*>i) — (1 — e ) S(i j)g£ Vij) 
for all alternative strategy choices {V(j} of advertiser i that satisfies the constraints in Equation® 

Now given the {V^j} and {B n j} values for all n £ INT — {i}, the approximate best response problem ( 
e-AdBRP) for i is to compute a set of {Vij} and {Bij} values satisfying the conditions in the above defini- 
tion of e-NE. 



As we mentioned earlier, the AdBRP is a variant of knapsack problem and we also know that the later 
can be approximated very well. Further, if we expect to get an FPTAS for AdBRP, we would expect that 
it also admits a pseudo-polynomial time algorithm (i.e. polynomial in N, M, K, V/s, and B^s) as well 
ll23l [Toll . Furthermore, all known pseudo-polynomial time algorithms for A^P-hard problems are based on 
dynamic programming ||23l [Toll . Therefore, naturally we try a similar approach. Perhaps not surprisingly, 
we design a pseudo-polynomial time algorithm for AdBRP. However, unlike the case of standard knapsack 
problemslEl [lOl [23] [16), this algorithm does not immediately give a FPTAS due to difficulty in handling 
the volume of queries and because for a given keyword (i.e. the item) queries may have different costs 
and utilities (i.e. all units of an item are not equivalent in cost and value) etc. Nevertheless, by judiciously 
utilizing the properties of the optimum and a double-layered approximation we will indeed present a FPTAS 
for AdBRP (Theorem©. We will devote the Sections [537Tl[53T2l[5331 on developing a FPTAS for AdBRP. 



Theorem 6 There is a FPTAS for AdBRP 



Thus, e-NE is indeed another reasonable solution concept for the Broad-Match Game. 



5.3.1 A Pseudo Polynomial Time Algorithm for AdBRP 

Without loss of generality, first let us assume that all relevant parameters such as budget, utilities, and costs 
are integer valued. This can be achieved by suitable approximation to rational numbers and then by appro- 
priate multiplication factors, without significant increase in instance size. 



Let P be the maximum total utility that the advertiser i can derive from a single keyword i.e. from 
any one of the j £ M while respecting her budget constraint. Therefore, MP is a trivial upperbound on 
the total utility that can be achieved by any solution (i.e. any choice of feasible Xj/s). For j G M and 
p = 1, 2, . . . , MP let us define, 



A(j,p) 



Min Y^i=iC{i,Ux 



U) 



0<x it i<Vi fori = 1,2,..., j 
x ijt eZ for I = 1,2, . . . , j 

oo if no solution to the above minimization problem exists 



(7) 



Clearly, the values A(l,p) for all p = 1, 2, . . . , MP can each be efficiently computed ( in 0{Nlog{V\)) 
time) by utilizing the partition structure as per Lemma [TJ and the Equations [4] and [5] The term log(V\) 
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instead of V\ comes from the the fact that it is enough to perform a binary search inside a partition for 
an appropriate Xi\. Therefore, the values A(l,p) for all p = 1,2,..., MP can be together computed in 
0{Nlog{Vx)MP) time. Let V = T, jE M V j- Now ' A U>P) for all j = 2, 3, . . . , M and p = 1,2,..., MP 
can be computed together in 0(M 2 PNV) using following recurrence relation. Further, for each choice of 
j = 1, 2, 3, . . . , M and p = 1,2,..., MP, we also compute and store the set of Xi,/s values that achieves 
the value of A(j, p), and this task can be performed in same order of time complexity. 



A(j + l,p) 



Min Ya=i C(i,l,Xij) + C(i,j + l,Xij +1 ) 

S - L Y%=iU(i,l,Xij) + U{i,j + l,x iJ+1 ) > p 
0<x it i<Vi for/ = l,2,...,i + l 
s f ,l€Z for/ = l,2,...,j + l 

oo if no solution to the above minimization problem exists 



mm 

x iJ+1 e{o,l^,-,Vj+i}-U(i,j+hxi,j+i)<p 



{C(i,j + l,x iJ+1 ) + A(j,p - U(i,j + l,x i)j+1 ^ 



After computing the A(j,p)'s, we can obtain the solution of AdBRP and the optimum value via 

OPT = max Mp {p : A(M,p) < . (9) 

Thus, we have a pseudo-polynomial time algorithm for AdBRP. 
5.3.2 A Pseudo Polynomial Time Approximation Scheme for AdBRP 

Now, we will convert the pseudo polytime algorithm of Section l5.3.1l to an approximation scheme by appro- 
priately rounding the payoffs of the advertisers. Let us call this scheme AS1. 

For a given e > 0, let T = |? and let us round the utility functions from U(i, I, x^i) to U' (i, I, Xij) = 

L^%^J. Now, let us use the above dynamic programming algorithm by replacing the upper bound on 
utility P to L^J and utilities U {i, I, x^i) to U (i, I, x^i). Let us denote the solution returned as V[ ■ i.e. the 
optimum of the problem after above rounding is attained at Xij = V-j,j £ M. Correspondingly, let the 
optimal solution without any rounding is Vij, j G M. Note that the values {T^j} are still feasible for the 
rounded problem. 

Now since {V-j} is optimal for the rounded problem we have 

jeM jeJvt 



> 2^ - J - 1 

jeJVt \ 



T 



/ M < 10 > 
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Further, using the inequality \\0\ we can get 



jeM jeM 

> ^2u(i,j,V id )-TM 
jeM 

jeM 

> (i-6)^t/(i,j,y 

jeM 

where the last inequality is implied by the fact that the optimal value without rounding will at least be the 
maximum derived from a single keyword i.e. EjeM^iii^ij) — Therefore, the {V^ •} gives an 
(1 — e) approximation to the advertiser i's best response. The time complexity of the dynamic programming 
applied to obtain the solution {V-j} is 0(M 2 [f \NV) = 0(M 2 N([^f\)V) which is polynomial in N, 
M, V and -. Note that this scheme is still pseudo polynomial due to appearance of V. This terms appears 
because when we build up the dynamic programming table we need to use the Equation [8] Nevertheless, 
this algorithm will be helpful in designing our FPTAS. 



5.3.3 FPTAS for AdBRP 

Now using the approximation scheme AS1 and by judiciously truncating set of possible query values we 
will present an FPTAS for AdBRP. Let us first note that for each keyword only the number of queries, 
starting the first one, that can be bought under budget constraint Bi are feasible. Therefore, for the ease of 
notation, without loss of generality, we can assume that Vj is infact the maximum number of queries that 
can be bought under budget constraint meaning all possible queries of a particular keyword are feasible if i 
wanted to spend her total budget on this keyword. Let V = YljeM Vj- Also, without loss of generality we 
have U(i,j, A) > for all A and j such that G £. 

Now, we perform a first layer of approximation by judiciously truncating the set of possible query values 
to obtain Lemma [8] Lemma |7] is just a warmup. 

Lemma 7 There is a feasible solution {Vij} of AdBRP with the total utility OPT satisfying the following 
properties where OPT is the total utility for the optimal solution of AdBRP: 

• At most for one j G M, Vij £ {z jfl , Zj,i,Zj,2, Zj,\j} 

• OPT > (1 - 5)OPT where 5 = rnax^- A 
Proof: 

Consider all the keywords j for which V^j £ {zj,o, Zj,i, Zj,2, • • • , z j,kj }• Now among these keywords, 
for the keywords with the maximum value of n(i,j, Vij), move the budget from one to the other un- 
less all keywords of such type except one satisfies the above condition, this moving around of budgets 
among the keywords with the same value of Tr(i,j, Vi.j) does not change the total utility. Thus, we can 
safely assume that there is a single keyword satisfying Vij £ {zj t o, Zj,i, Zj,2, ■ ■ ■ , z j,A 3 } with the maxi- 
mum value of ir(i,j, Vij) and say its I. Further, we can move budget from all other keywords satisfying 
Vij g {z jt o, z j: i, z j;2 , • • • , Zj^} to I until they satisfy Vj € {z jfi , z j; i, z jj2 , Zj^}, while losing at 
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most u(i, I, Vi : i). This is because while doing this reallocation we lose only when we do not have enough 
budget to move to buy the next query of 1. 1. 

Lemma 8 For every given parameter e < 1, there is a feasible solution {Vi j} of AdBRP with the total 
utility OPT satisfying the following properties where OPT is the total utility for the optimal solution of 
AdBRP: 

• OPT > (1 - e 2 )OPT 

• Vij E {Vjfi-, Uj,i,yj,2, ■ ■ ■ , Vj,nj} f or a tt 3 € M where the yj : , values are defined as follows: 



for each partition A = 1,2, ... ,Aj of the keyword j let — Zj t \-i = Q>\^z~\ + b where a and 
b are non-negative integers and b < \^~\ (recall that Zj \ — Zj,\-i is the size of partition X). Now 
divide the partition A in to min{a, 1}. p^-] +b sub-partitions, the first min{a, 1}. \^~\ sub -partitions 
being of size a queries each and the next b partitions each of size 1 query each. Define y^o = and 
yj,liUj,2, • • • ; Uj,nj as the end points of the sub-partitions created as above. 

Proof: Clearly, each partition is divided in to at most 2|~^f ] sub-partitions and since Aj = O(N) we obtain 
n 3 = 0{™). 

Let {Vij} be an optimal solution for AdBRP, then we find a solution {Vij} as follows: 

• if Vij G {yj,o, yj,i,Uj,2, • • • , Vj,nj} then Vij = Vij, therefore we do not lose anything in total utility 
coming from keyword j by moving from Vij to Vij. 

• if V it j <£ {y jfl , y jt i, yj )2 , yj, nj } then define V id to be the maximum value among {y jfi , yj,i,yj,2, • • • , yj, nj } 
which is smaller than Vij. Note that this case arises only when V{j comes from a sub-partition of 

size a > 1 and in that case there are at least \^~\ sub-partitions of size a with the same utility value 
u(i, j, Vij ) for each query in those sub-partitions. The maximum utility that we can lose by truncating 

Vj to Vtj is < a u(i,j, Vj) < (rf 1 a u(i,j, V id )) < j^OPT < ^OPT. 

Note that the way we have constructed {Vij}, it satisfies the budget constraint of the advertiser i 
because Vj < Vij and hence {Vij} is a feasible solution of AdBRP. Now, across all the keywords 
we can lose at most M jjOPT i.e. e 2 OPT by truncating the optimal solution {Vij} to the feasible 
solution {Vij}. I. 

Now we perform a second layer of approximation by applying AS1 with truncated set of possible query 
values as per Lemma[8] We present this approximation algorithm called AS2 in the following. 

1. Divide each partition of each keyword in to several sub-partitions as follows (same as in the proof of 
Lemma[U). For each partition A = 1, 2, . . . , Aj of the keyword j let Zj t \ — #j,a-1 = a + b where a 
and b are non-negative integers and b < ] (recall that Zj t \ — %,a-i i s the size of partition A). Now 
divide the partition A in to min{a, 1}. \^~\ + b sub-partitions, the first min{a, 1}. \^~\ sub-partitions 
being of size a queries each and the next b partitions each of size 1 query each. Define yj^ = and 
yj i, y 3 ,' 2, • • • , yj,n,j as the end points of the sub-partitions created as above. 

2. Apply AS1 by restricting the query values to take values only from the set {yj t o, yj t i, yj : 2, ■ ■ ■ , yj,nj} 
i.e. by changing the recurrence relation [8] to 

A(j+l,P)= min _ \C(i,j + l,Xij +1 )+A(j,p-U(i,j + l,Xij +1 ))\ 

xi,j+i£{yj : o,yj,i,yj,2,--;yj,n j }--U(ij+i,x itj+1 )<p L J 

(11) 
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and with the error parameter e, where i = i + 1 < -. 
The total utility of the solution returned by the algorithm AS2 is 

> (1 - e)(l - e 2 )OPT = (1 - t^)(1 - e 2 )OPT = (1 - e)OPT. 

The time complexity in constructing the sub-partitions is 0(J^-) and that of applying AS1 on the 

truncated set of query values {y h , y jtU y jf2 , %>i } is 0(M 2 7V( Lf J)^f) = 0(^^)asi = ±+1 < 
2 = O(i). Therefore, AS2 is indeed a FPTAS for AdBRP. 

5.4 To Broad-match or not to Broad-match 

In this section, we start out with a very important observation in the following theorem. 

Theorem 9 A BMG does not necessarily have a unique BME and the different BMEs can yield different 
revenues to the auctioneer. Moreover, in one of them the auctioneer loses while in the other one it gains in 
terms of revenue. 

The Theorem [9] leaves the auctioneer in a dilemma about whether he should broad-match or not. If he 
could somehow predict which choice of broad match lead to a revenue improvement for him and which not, 
he could potentially choose the ones leading to a revenue improvement. This brings forth one of the big 
questions left open in this paper, that is of efficiently computing a BME I e-NE, if one exists, given a choice 
of broad match. We plan to explore it in our future works. The proof of the above theorem follows from 
examples constructed in Figures [51 [6l [7] Further, in all the examples we take K = 2, 71 = 1, 72 = 0.7. 

We also note several other observations such as 

• introducing an edge may or may not shift the BME ( See Figures |8l9l) 

• introducing an edge can shift the BME to one yielding less revenue to the auctioneer , as well as, to 
one with less efficiency i.e. social welfare ( See Figure [I][9]) 

• introducing new edges can shift the BME to one yielding more revenue to the auctioneer (See Figures 

• unlike in Figure |9l an extension of a BMG may not have a BME where for all nodes the degree is the 
same as in the BME for the base BMG. See Figures [TJ |7J 

It is instructive to note that all the above observations {including Theorem |9]l continue to hold 
under the solution concept of e-NE. 
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Advertisers 



Keywords 




\ / 




Figure 5: A BMG (without dashed edges) and its extension (with dashed edges). The values shown along 
the edges are Sjj respectively. Further, note that it is a good broad-match as valuations Sij of an advertiser 
along new edges is greater than her valuation along old edges. 
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E = E Q + 1.4V 3 - 0.7Vi 



Figure 6: One BME for the extension of BMG in Figure [5] ( the values Vij at this BME are shown along 
corresponding edges). 

Not a pure Nash equilibrium but an e-NE for e > 0.15. 
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MP+ 2 = 1.67 

mp 5 : 3 = 4 




E = E + 0.7^ - 0.7V! 
Figure 7: Another BME for the extension of BMG in Figure [5] 
Also a pure Nash equilibrium, therefore an e-NE for e > 0. 
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Advertisers Keywords 



Ui = 4.1Fi 




Figure 8: No Shift in Equilibrium due to the new edge (3,1). The values shown along edge (i, j) is (sjj, Vij). 
U{ denote total payoff of advertiser i. 



Advertisers Keywords 




Figure 9: A Shift in Equilibrium due to the new edge (3,1). The values shown along edge is (sjj, Vi j). 
m denote total payoff of advertiser i. 
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6 Auctioneer Controlled Broad Match(AcBM) 



In this section we study the second broad match scenario i.e. AcBM as described in section|4] First, we need 
a few definitions. 

Definition 10 Excess Budget: Let G = (N, JVC, £) be a BMG and G' = (J\T, JVC, £ J Z?e arc extension (broad- 
match) of G. For an i G N, Zef Z?i denote the budget of i unspent in G and let s% = maxj g £ Sjj. Wfe raj ?/zotf 
i has an excess budget in G iff Si < Z)j. 

Intuitively, what we mean by an advertiser to have an excess budget is that she has enough amount currently 
left unspent so that she can participate in at least one query of any of the keywords she is currently bidding 
on, if given a chance, irrespective of the valuations of her competitors. 

Definition 11 Revenue Improving Broad-Match: An extension G' = (DM*, JVC, £ ) o/BMG G = (DM", JVC, £) 
is called revenue improving if there exist an allocation of the excess budgets along new edges (i.e. edges in 
£ \ £) so that the sum of total budget spent by all the advertisers is more in G than in G, as well as, there 
is a strongly polynomial time algorithm to find such an allocation of the excess budgets. 

Now, for a keyword j, let I(j, I) denote the set of advertisers having sufficient budget to participate in 
the Zth query of this keyword. The following lemma can be obtained in a way similar to the Lemma Q] and 
again utilizing the partition structure per this lemma, we obtain Observation [13] that as long as the quality 
of broad-match is good in some sense, the auctioneer can guarantee a better revenue for himself by suitably 
exploiting the extension. 

Lemma 12 Given the budget splitting of all the advertisers for the BMG G = (DM", JVC, £) (i.e. the cur- 
rent budget splitting ), that is the values Bi j's for all G £, there exist non-negative integers Aj, 

Zj,0i Zj,l, Zj,2i ■ ■ ■ j z j,kj' and sets Ij,i->Ij,2, ■ ■ ■ , Ij,Aj> Ij f or Q U 3 e -M-} sucn that A? = ^(-^0 and for 
all < A < Aj - 1, c(i,j,l) = C(i,j,X),I(j,l) = Vz jiA < I < z jtX +i where C(i,j,\) := 

c(i,j,Zj t \ + l),Ij j A+i := I{jiZj,\ + 1) and Ij = {i € JSf : G £, i has an excess budget in G}. 

Moreover, all these A, z, C, I values can be computed together in time polynomial in N, M and K. 

Observation 13 Let G' = (Jsf,JVC, £') be an extension of BMG G = (Jsf,JVC, £), I = U jeM Ij, Ji = 
{j G JVC : £ £' \ £ 1 for i € I, Tj = ii € JV : G £'}, and $ = {j £ JVC : = </>}. This 
extension is revenue improving if there is ani G I having one or more of the following properties/conditions: 

a) 3j G Ji \ $ such that Ij t K 4 = Ij and {m£ Ij^ : s m j > Si,j} | < K. 

b) 3j G Ji D $ such that \ {I G I fl Tj : s^j = Sjj} U {Z G Tj : s; j < Sjj} | > 1. 

c) 3j G Ji \ $ rac/i f/iaf /j^Aj 7^ Jj Sjj > max| e ;. A y. . 

Intuitively, the properties a) and c) says that there is an advertiser i with excess budget in G and a keyword 
j such that i has a good enough valuation for j in G so as to obtain a slot, and moreover if i is brought in 
for the last query, all advertisers already participating in that query have still enough budget to participate 
in that query. The condition b) says that there is a keyword j with unsold queries in G and we can sell 
these queries in G' to at least two advertisers^. Thus, these conditions allow the auctioneer to bring addi- 
tional advertiser(s) for the keyword j to fs last query or equivalently the very first query in the last partition 

13 Note that just selling to one advertiser does not generate any money in GSP. In practice, however Google/Yahoo! charges a 
minimum amount i.e. a reserve price for the last slot. Nevertheless, all the results we present remains unchanged by introducing 
reserve prices for the last slot. In that case, the condition b) will change to 3j £ Ji fl <&. 
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((zj a, i-i + l)th query) so that revenue extracted from this particular query is improved without changing 
the revenue generated from any other query. Therefore, in terms of the existence of a revenue improving 
extension we have the above observation. Nevertheless, in general the auctioneer can do much more than 
the above trivial way. In particular, there might be several i and j's satisfying the properties in Observation 
[T3l and he could exploit this profitably. Recall that he has a finer control over which query to start with 
and whatever part of excess budget he can spend along edges in £ \ £. So his task is to choose splitting 
of excess budgets along new edges as well as to decide a starting query number. In general, there could 
be O(N) advertisers with excess budgets and there could be O(MN) new edges, finding the best splitting 
is a variant of Integer Knapsack problem again and thus computationally hard. Nevertheless, it is clearly 
possible to design strongly polynomial time sub-optimal algorithm that does significantly better than the 
trivial improvement possible by increasing competition for the last query. The problem with participating 
starting a query in other partitions is that it may change the partition structure in a way that is not revenue 
improving, but since there are only O(N) such partitions for each keyword, we could check for each parti- 
tion whether starting with its first query is revenue improving or not. Indeed, it is easy to think of a strongly 
polynomial time algorithm that finds excess budget splittings which improves revenue, one that moves from 
partition to partition, taking starting query as the first query in that partition and then doing a binary search 
for appropriate budget (as high as possible) to be allocated and tracks which one of these possibilities lead 
to the highest revenue. Note that since we are doing a binary search on budget (and equivalently on the 
number of queries to participate in), we are still in strongly polynomial time regime. Finally, it should be 
interesting to search for efficient algorithms generating better revenue and in particular a FPTAS, possibly 
by efficiently searching for which query to start with along with efficiently searching for how many queries 
to participate for. 

Now, given that auctioneer's goal is primarily to improve revenue, we should also analyze what happens 
to the efficiency (i.e. social welfare), if the auctioneer implements such revenue improving broad-match. 
First, it is clear that if auctioneer's goal were to improve social welfare instead of revenue, he could cer- 
tainly do so in a way similar to the one discussed above for revenue improvement. Moreover, revenue and 
social welfare could infact be improved together if one of the conditions in Observation [13] holds and the 
proof is same as that for Observation [13] by bringing in appropriate advertiser(s) in the last query of an ap- 
propriate keyword. However, if auctioneer deviates from this trivial way of improving revenue, which infact 
he will do if his goal is to maximize revenue, there can often be a tradeoff with social welfare. For an explicit 
example of this tradeoff please refer to Figure [TO] Furthermore, even when the conditions in Observation \T3\ 
are NOT satisfied, there might still be a possibility of revenue improvement (ref. Figure [j}. 
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{1,2,3} 



{2,3} 



{3} 



«) I 1 1 1 

19 36 100 

R a = 19 x (0.3 x 3 + 2 x 1.4 x 2) + 17 x (0.3 x 2) = 80.5 > Rq 
E a = 19 x (5 + 0.7 x 3) + 17 x (3 + 0.7 x 2) + 64 x 2 = 337.7 < E 



{1,2} 



{2,3} 



b) I 1 1 

50 100 

R h = 50 x (0.3 x 3) + 50 x (0.3 x 2) = 75 > R 

E b = 50 x (5 + 0.7 x 3) + 50 x (3 + 0.7 x 2) = 575 > E 



Ro < R b < R a but E a < E < E b 



{1,2} {1,2,3} {2,3} {3} 

3 21 40 100 

R c = 3 x (0.3 x 3) + 18 x (0.3 x 3 + 2 x 0.7 x 2) + 19 x (0.3 x 2) = 80.7 

E c = 21 x (5 + 0.7 x 3) + 19 x (3 + 0.7 x 2) + 60 x 2 = 352.7 

Rc > Ra 

Figure 10: Tradeoff in revenue and social welfare, the base BMG does not include edge (3, 1), its extension 
does. The revenue and efficiency value is just for keyword 1, as these values do not change for keyword 
2 even in the extension. In a) and b) advertiser 3 is brought in for keyword 1 starting with first query 
in partition 1 and 2 respectively. Note that the choice of partition with better revenue is the one where 
efficiency decreases, c) shows that just starting witf^^he first query of a partition may not lead to optimal 
revenue, for example if 3 is started with 4th query then the revenue improvement is even better than a) or 
b). 



7 Future Directions 



We have initiated a study of broad-match, an interesting aspect of sponsored search advertising, and as com- 
mon to papers that initiate a new direction of study, this paper leaves out several important open problems 
that deserve theoretical investigation and analysis. We discuss some of these interesting problems in the 
following. 

Budget Splitting Games(BSG): Abstracting the settings in Broad-Match Game can provide us with a rich 
class of mutli-player games having compact representations. It should be interesting to study these games 
and to consider the budget splitting/allocation scenarios beyond the currently prevailing models for spon- 
sored search advertising, as well as, other interesting applications. Herebelow we provide an abstraction. 

An instance of BSG is given by (N, M, £, B, V, 0). N is the set of players, M is set of distinct type of 
indivisible items and (N, DVt, £) is a bi-partite graph between the players and items wherein an (i, j) £ £ iff 
the player i £ N is interested in buying the item j £ M (i.e. % have a positive valuation for j). B = {Bi} iG yj 
is the budget vector, B{ being the total budget of player i. V = {Vj}j^j^ * s trie volume vector, Vj being 
the total number of units of the item j for sale. Let |N| = N and |M| = M. is an oracle that takes 
as input £ £ and a set of budget values {B n> j} ne yj_u\ and output the set of values Aj = O(N), 

Zjfl = 0,Zj t i, Zj t 2, ■ ■ ■ ,Zj,Aj = Vj, and C(i,j,\),U(i,j,\) for all < A < Aj — 1, to be interpreted 
as follows: given that for all n £ IN" — {i}, the player n spends/allocates an amount B n j on the item j, 
C(i,j,X) and U(i,j,X) respectively are the cost and the utility for each unit of the item if the player i 
decides to buy z 3 -. \ < Vi j < ^,a+i units of the item j. Each player has access to the oracle 0. The strategy 
of a player i is to split her budget across various items that is to choose the values {-Bjj}je3vt:(i,j)e£ suc h 
that J2jeM-(i i)e£ -^m — ^ anc ^ correspondignly the number of units of the various items {Vij } Jg ]vt:(j,i)e£ 
with < Vj. The payoff of a player is the total utility it derives across all the items from their units she 
bought as per her chosen strategy. 

Note that this abstraction nicely captures the Broad-Match Game studied in the present paper wherein 
an item is a keyword and a query of a keyword corresponds to a unit. The oracle corresponds to the 
LemmaQ] Further, by varying the properties satisfied by the values C(i,j, A), U(i,j, A)'s we may obtain 
other interesting scenarios. 

Complexity of Auctioneer's Dilemma: The Theorem [9] leaves the auctioneer in a dilemma about whether 
he should broad-match or not. If he could somehow predict which choice of broad match lead to a revenue 
improvement for him and which not, he could potentially choose the ones leading to a revenue improvement. 
This brings forth one of the big questions left open in this paper, that is of efficiently computing a BME I 
e-NE, if one exists, given a choice of broad match, as well as, to efficiently decide whether a given broad 
match is revenue improving or not in the AdBM scenario. 

Existence of pure NE/e-NE/BME in Broad-Match Game: Despite some effort we have not been able to 
show that a Broad-Match Game always admits pure NE or even e-NE or BME. On the other hand we have 
also not been able to construct examples where they do not exist. The problem arises from the fact that the 
payoff funtions are discontinuous and not even quasi-concave. Therefore, the existing techniques or their 
simple extensions do not seem to work. This calls for developing new techniques for proving the existence 
of pure Nash equilibria and the complexity of deciding the existence in general, and for the Broad-Match 
Game and BSGs in particular. 

Quantifying the Effect of Broad-Match: As per various observations including Theorem [9] obtained in 
the Section [5^41 we know that the the revenue of the autioneer and the social welfare could very well degrade 
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by incorporating broad-match and it would be interesting to characterize the extent of such degradation. 

Total Click-Through-Rates as Payoff: It would be interesting to analyze the effect of broad-match in a 
scenario where the advertisers are interested in maximizing their total Click-Through-Rate across various 
keywords rather than their total utilities. Note that the budget splitting game arising in this sceneario indeed 
fits in our abtract model of BSG discussed earlier in this section. With the intuition developed from this paper, 
we believe that similar conclusions hold true even in the case of total Click-Through-Rates as payoffs. 

BME vs e-NE: As we argued in the paper, both the BME and e-NE are reasonable solution concepts 
for the Broad-Match Game. Further, we constructed examples where the both solution concepts coincide. 
Nevertheless, it would be nice to study these two concept comparatively at a finer level. For example, BME 
may not provide good approximation guarantee to the advertisers but it demands less shophistication level 
from them than that required for good approximation guarantee. 

Network Level Competition via Broad-Match Game: The framework provided in the present paper 
is easily applicable to the scenario where advertisers are trying to optimally split budgets across various 
keywords coming from several competing search engines. How does the revenue of one search engine gets 
effected by the fact that another competeting search engine offers broad-match? For example consider the 
BMG in the Figure |9l without the dashed edge (3, 1), and suppose that the keyword 1 is coming from the 
search engine 1 and the keyword 2 is coming from the serach engine 2. Now if the search engine 1 does 
broad-match and introduces the new edge (3, 1) then we can see that the revenue of the search engine 1 
increases whereas that of search engine 2 decreases. It should be interesting to further explore this direction. 

Formalizing a Notion of "Good" Broad-Match: In the analysis we have presented, we have not con- 
sidered costs incurred by the auctioneer in the short run due to uncertainty about the extended part of the 
BMG, although we can expect that as long as the quality of broad match being performed is good, such costs 
should be minimal. Nevertheless, formalizing a notion of good broad match should be interesting. Features 
such as the improvement in the relevance scores and conversion rates should be an essential ingredient of 
a good broad match. Further, the conditions noted in the Observation [13] also provides some sense of what 
should be the features of a good broad-match from the viewpoint of the auctioneer. 
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